Predicting how it sounds: re-ranking dialogue prompts based on TTS quality for adaptive spoken dialogue systems

نویسندگان

Cédric Boidin

Verena Rieser

Lonneke van der Plas

Oliver Lemon

Jonathan Chevelu

چکیده

This paper presents a method for adaptively re-ranking paraphrases in a Spoken Dialogue System (SDS) according to their predicted Text To Speech (TTS) quality. We collect data under 4 different conditions and extract a rich feature set of 55 TTS runtime features. We build predictive models of user ratings using linear regression with latent variables. We then show that these models transfer to a more specific target domain on a separate test set. All our models significantly outperform a random baseline. Our best performing model reaches the same performance as reported by previous work, but it requires 75% less annotated training data. The TTS re-ranking model is part of an end-to-end statistical architecture for Spoken Dialogue Systems developed by the ECFP7 CLASSIC project.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Predicting and Adapting to Poor Speech Recognition in a Spoken Dialogue System

Spoken dialogue system performance can vary widely for different users, as well for the same user during different dialogues. This paper presents the design and evaluation of an adaptive version of TOOT, a spoken dialogue system for retrieving online train schedules. Adaptive TOOT predicts whether a user is having speech recognition problems as a particular dialogue progresses, and automaticall...

متن کامل

Response Generation Based on Hierarchical Semantic Structure with POMDP Re-ranking for Conversational Dialogue Systems

Conversational spoken dialogue systems can assist individuals to communicate with machine to obtain relevant information to their problems efficiently and effectively. By referring to relevant response, individuals can understand how to interact with an intelligent system according to recommendations of dialogue systems. This work presents a response generation based on hierarchical semantic st...

متن کامل

Toward Construction of Spoken Dialogue System that Evokes Users' Spontaneous Backchannels

This paper addresses a first step toward a spoken dialogue system that evokes user’s spontaneous backchannels. We construct an HMM-based dialogue-style text-to-speech (TTS) system that generates human-like cues that evoke users’ backchannels. A spoken dialogue system for information navigation was implemented and the TTS was evaluated in terms of evoked user backchannels. We conducted user expe...

متن کامل

Robust and adaptive architecture for multilingual spoken dialogue systems

We present how robustness and adaptivity can be supported by the spoken dialogue system architecture. AthosMail is a multilingual spoken dialogue system for e-mail domain. It is being developed in the EU-funded DUMAS project. It has flexible system architecture supporting multiple components for input interpretation, dialogue management and output generation. In addition to language differences...

متن کامل

MIMIC: An Adaptive Mixed Initiative Spoken Dialogue System for Information Queries

This paper describes MIMIC, an adaptive mixed initiative spoken dialogue system that provides movie showtime information. MIMIC improves upon previous dialogue systems in two respects. First, it employs initiative-oriented strategy adaptation to automatically adapt response generation strategies based on the cumulative effect of information dynamically extracted from user utterances during the ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2009

Predicting how it sounds: re-ranking dialogue prompts based on TTS quality for adaptive spoken dialogue systems

نویسندگان

چکیده

منابع مشابه

Predicting and Adapting to Poor Speech Recognition in a Spoken Dialogue System

Response Generation Based on Hierarchical Semantic Structure with POMDP Re-ranking for Conversational Dialogue Systems

Toward Construction of Spoken Dialogue System that Evokes Users' Spontaneous Backchannels

Robust and adaptive architecture for multilingual spoken dialogue systems

MIMIC: An Adaptive Mixed Initiative Spoken Dialogue System for Information Queries

عنوان ژورنال:

اشتراک گذاری